Recycling Lingware in a Multilingual MT System

نویسندگان

  • Manny Rayner
  • David M. Carter
  • Ivan Bretan
  • Robert Eklund
  • Mats Wirén
  • Steffen Leo Hansen
  • Sabine Kirchmeier-Andersen
  • Christina Philp
  • Finn Sorensen
  • Hanne Erdman Thomsen
چکیده

We describe two methods relevant to multilingual machine translation systems, which can be used to port linguistic data (grammars, lexicons and transfer rules) between systems used for processing related languages. The methods are fully implemented within the Spoken Language Translator system, and were used to create versions of the system for two new language pairs using only a month of expert effort.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Testing and Evaluation of Multilingual Language Technology Resources and Components

German Research Center for Artificial Intelligence (DFKI), Language Technology Lab Stuhlsatzenhausweg 3, D-66123 Saarbrücken, Germany email: {ulrich.schaefer,daniel.beck}@dfki.de Abstract We describe SProUTomat, a tool for daily building, testing and evaluating a complex general-purpose multilingual natural language text processor including its linguistic resources (lingware). Software and ling...

متن کامل

JTaCo & SProUTomat: Automatic Evaluation and Testing of Multilingual Language Technology Resources and Components

We describe JTaCo, a tool for automatic evaluation of language technology components against annotated corpora, and SProUTomat, a tool for building, testing and evaluating a complex general-purpose multilingual natural language text processor including its linguistic resources (lingware). The JTaCo tool can be used to define mappings between the markup of an annotated corpus and the markup prod...

متن کامل

Generation for Multilingual MT

This paper presents an overview of the broad-coverage, application-independent natural language generation component of the NLP system being developed at Microsoft Research. It demonstrates how this component functions within a multilingual Machine Translation system (MSR-MT), using the languages that we are currently working on (English, Spanish, Japanese, and Chinese). Section 1 provides a sy...

متن کامل

On Portability of Resources for a Quick Ramp up of Multilingual MT of Patent Claims

We describe a feasibility study on reusing the components of the unilingual authoring application AutoPat in a full-scale multilingual MT system APTrans, and explore to which extent MT knowledge can be ported from one language to another in the patent domain. We illustrate our findings on the example of English, Danish and

متن کامل

Two Principles and Six Techniques for Rapid Mt Development

In this paper we describe a range of techniques used at NMSU CRL for accelerating the development of MT systems. These techniques enable semi-automatic development of a number of components of a multilingual MT system, thereby enabling rapid deployment of MT capabilities in a new language. First, we describe the core multi-engine, multilingual architecture that enables the different techniques ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره cmp-lg/9705007  شماره 

صفحات  -

تاریخ انتشار 1997